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Abstract 

We construct a hierarchy of regular languages such that the cur- 
rent language in the hierarchy can be accepted by 1-way quantum finite 
automata with a probability smaller than the corresponding probabil- 
ity for the preceding language in the hierarchy. These probabilities 
converge to ^ . 
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1 Introduction 



Quantum computation is a most challenging project involving research both 
by physicists and computer scientists. The principles of quantum compu- 
tation differ from the principles of classical computation very much. The 
classical computation is based on classical mechanics while quantum com- 
putation attempts to exploit phenomena specific to quantum physics. 

One of features of quantum mechanics is that a quantum process can be 
in a combination (called superposition) of several states and these several 
states can interact one with another. A computer scientist would call this a 
massive parallelism. This possibility of massive parallelism is very important 
for Computer Science. In 1982, Nobel prize winner physicist Richard Feyn- 
man (1918-1988) asked what effects the principles of quantum mechanics 
can have on computation | Fe 82 1. An exact simulation of quantum processes 
demands exponential running time. Therefore, there may be other compu- 
tations which are performed nowadays by classical computers but might be 
simulated by quantum processes in much less time. 

R.Feynman's influence was (and is) so high that rather soon this possibil- 
ity was explored both theoretically and practically. David Deutsch[ De 89| ] 
introduced quantum Turing machines, quantum physical counterparts of 
probabilistic Turing machines. He conjectured that they may be more ef- 
ficient that classical Turing machines. He also showed the existence of a 
universal quantum Turing machine. This construction was subsequently 
improved by Bernstein and Vazirani [BV 97] and Yao |Ya 93]. 

Quantum Turing machines might have remained relatively unknown but 
two events caused a drastical change. First, Peter Shor [ ^h 97 ] invented 
surprising polynomial-time quantum algorithms for computation of discrete 
logarithms and for factorization of integers. Second, unusual quantum cir- 
cuits having no classical counterparts (such as quantum bit teleportation) 
have been physically implemented. Hence, there is a chance that universal 
quantum computers may be built. Moreover, since the modern public-key 
cryptography is based on intractability of discrete logarithms and factor- 
ization of integers, building a quantum computer implies building a code- 
breaking machine. 

In this paper, we consider quantum finite automata (QFAs), a different 
model of quantum computation. This is a simpler model than quantum 
Turing machines and and it may be simpler to implement. 

Quantum finite automata have been studied in ] AF 98 , [BP 9£ , KW 97 , 
MC 97]. Surprisingly, QFAs do not generalize deterministic finite automata. 
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Their capabilities are incomparable. QFAs can be exponentially more space- 
efficient |^F|^9^ . However, there are regular languages that cannot be rec- 
ognized by quantum finite automata[ |KW 97 1. 

This weakness is caused by reversibility. Any quantum computation 
is performed by means of unitary operators. One of the simplest proper- 
ties of these operators shows that such a computation is reversible. The 
result always determines the input uniquely. It may seem to be a very 
strong limitation. Luckily, for unrestricted quantum algorithms (for in- 
stance, for quantum Turing machines) this is not so. It is possible to embed 
any irreversible computation in an appropriate environment which makes it 



reversible [ Be 89 1. For instance, the computing agent could keep the inputs 
of previous calculations in successive order. Quantum finite automata are 
more sensitive to the reversibility requirement. 

If the probability with which a QFA is required to be correct decreases, 
the set of languages that can be recognized increases. In particular] AF 9S ], 
there are languages that can be recognized with probability 0.68 but not 
with probability 7/9. In this paper, we extend this result by constructing a 
hierarchy of languages in which each next language can be recognized with 
a smaller probability than the previous one. 



2 Preliminaries 

2.1 Basics of quantum computation 

To explain the difference between classical and quantum mechanical world, 
we first consider one-bit systems. A classical bit is in one of two classical 
states true and false. A probabilistic counterpart of the classical bit can be 
true with a probability a and false with probability /3, where a + (3 = 1. 
A quantum bit (qubit) is very much like to it with the following distinction. 
For a qubit a and /3 can be arbitrary complex numbers with the property 
+ II/^IP = 1- If observe a qubit, we get true with probability ||a|p 
and false with probability just like in probabilistic case. However, 

if we modify a quantum system without observing it (we will explain what 
this means), the set of transformations that one can perform is larger than 
in the probabilistic case. This is where the power of quantum computation 
comes from. 

More generally, we consider quantum systems with m basis states. We 
denote the basis states \qi), \q2), ■ ■ ., \qm)- Let V' be a linear combination of 
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them with complex coefhcients 

ip = ai \qi) + a2 \q2) + ■ ■ ■ + a. 
The I2 norm of ■0 is 



aiP + |a2P + . .. + \ar, 



The state of a quantum system can be any ■0 with \\ip\\ = 1. ■0 is caUed 
a superposition of «i) Om are cahed amplitudes of 

. . ., I^m)- We use hiQ) to denote the vector space consisting of aU Hnear 
combinations of . . ., \qm)- 

AUowing arbitrary complex amplitudes is essential for physics. However, 
it is not important for quantum computation. Anything that can be com- 
puted with complex amplitudes can be done with only real amplitudes as 
well. This was shown for quantum Turing machines in | BV 93| ]'] and the 



same proof works for QFAs. However, it is important that negative am- 
plitudes are allowed. For this reason, we assume that all amplitudes are 
(possibly negative) reals. 

There are two types of transformations that can be performed on a quan- 
tum system. The first type are unitary transformations. A unitary transfor- 
mation is a linear transformation U on hiQ) that preserves I2 norm. (This 
means that any ip with \\ip\\ = 1 is mapped to ^' with = 1.) 

Second, there are measurements. The simplest measurement is observing 
ip = ai \qi) + 02 1^2) + ■■■ + am \qm) in the basis \qi) , . . . , \qm)- It gives 
\qi) with probability q?. (||'(/'|| = 1 guarantees that probabilities of different 
outcomes sum to 1.) After the measurement, the state of the system changes 
to l^i) and repeating the measurement gives the same state \qi). 

In this paper, we also use partial measurements. Let Qi, ■ ■ ■ ,Qk be 
pairwise disjoint subsets of Q such that Qi U Q2 D . . . U Qk = Q- Let Ej, 
for j £ {1, • • ■ ,k}, denote the subspace of hiQ) spanned by \qj), j £ Qi. 
Then, a partial measurement w.r.t. Ei, . . . , gives the answer ip G Ej with 
probability J^ieQj ctf • After that, the state of the system collapses to the 
projection of ip to Ej. This projection is ipj = J2i£Qj '^i !%)• 

2.2 Quantum finite automata 

Quantum finite automata were introduced twice. First this was done by C. 
Moore and J.P.Crutchfield |MC 97] ]. Later in a different and non-equivalent 



way these automata were introduced by A. Kondacs and J. Watrous |KW97] 



^For unknown reason, this proof does not appear in [BV 97] 
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The first definition just mimics the definition of 1-way probabihstic finite 
automata only substituting stochastic matrices by unitary ones. We use a 
more elaborated definition pW 97 ], 



A QFA is a tuple M = {Q;T,;V; qq; Qacc', Qrej) where Q is a finite set 
of states, S is an input alphabet, F is a transition function, qo G Q is 
a starting state, and Qacc C Q and Qrej C Q are sets of accepting and 
rejecting states. The states in Qacc and Qrej are called halting states and 
the states in Qnon = Q — {Qacc U Qrej) are called non halting states, k and 
$ are symbols that do not belong to S. We use k and $ as the left and the 
right endmarker, respectively. The working alphabet of M is F = S U {k; $}. 

The transition function V isa mapping from T x hiQ) to hiQ) such that, 
for every a G F, the function Va : hiQ) h{Q) defined by Va{x) = ^(a, x) 
is a unitary transformation. 

The computation of a QFA starts in the superposition \qQ). Then trans- 
formations corresponding to the left endmarker k, the letters of the input 
word X and the right endmarker $ are applied. The transformation corre- 
sponding to a € F consists of two steps. 

1. First, Va is applied. The new superposition if)' is V^(V') where if) is 
the superposition before this step. 

2. Then, iIj' is observed with respect to Eacc, Erej , Enon. where Eacc = 
span{\q) : q £ Qacc}, Erej = span{\q) : q £ Qrej}, Enon = span{\q) : q £ 
Qnon} (see section 

If we get ip' E Eacc, the input is accepted. If we get ip' E Erej, the input 
is rejected. If we get ijj' E Enon, the next transformation is applied. 

We regard these two transformations as reading a letter a. We use to 
denote the transformation consisting of Va followed by projection to Enon- 
This is the transformation mapping ^ to the non-halting part of Va{ip). We 
use tpy to denote the non-halting part of QFA's state after reading the left 
endmarker k and the word y E S*. 

We compare QFAs with different probabilities of correct answer. This 



problem was first considered by A. Ambainis and R. Freivaldsj AF_98|. The 
following theorems were proved there: 

Theorem 2.1 Let L be a language and M be its minimal automaton. As- 
sume that there is a word x such that M contains states qi, q2 satisfying: 

1- qi + (12, 

2. If M starts in the state qi and reads x, it passes to q2, 
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3. If M starts in the state q2 and reads x, it passes to q2, and 
4- q2 is neither "all- accepting" state, nor "all-rejecting" state. 

Then L cannot be recognized by a 1-way quantum finite automaton with 
probability 7/9 + e for any fixed e > 0. 

Theorem 2.2 Let L be a language and M be its minimal automaton. If 
there is no qi,q2,x satisfying conditions of Theorem \2.\ then L can be rec- 
ognized by a 1-way reversible finite automaton (i.e. L can be recognized by 
a 1-way quantum finite automaton with probability 1). 

Theorem 2.3 The language a*b* can be recognized by a 1-way QFA with 
the probability of correct answer p = 0.68... where p is the root of p^ +p = 1. 

Corollary 2.1 There is a language that can be recognized by a 1-QFA with 
probability 0.68... but not with probability 7/9 + e. 

For probabilistic automata, the probabiUty of correct answer can be 
increased arbitrarily and this property of probabilistic computation is con- 
sidered as evident. Theorems above show thatits counterpart is not true in 
the quantum world! The reason for that is that the model of QFAs mixes 
reversible (quantum computation) components with nonreversible (measure- 
ments after every step). 

In this paper, we consider the best probabilities of acceptance by 1-way 
quantum finite automata the languages a*b* . . . z* . Since the reason why the 
language a*b* cannot be accepted by 1-way quantum finite automata is the 



property described in the Theorems 2.1 and 2.2, this new result provides an 



insight on what the hierarchy of languages with respect to the probabilities 
of their acceptance by 1-way quantum finite automata may be. We also 



show a generalization of Theorem 2.3 in a style similar to Theorem 2.2. 

3 Main results 

Lemma 3.1 For arbitrary real xi > 0, X2 > 0, Xn > 0, there exists a 
unitary n x n matrix Mn{xi, X2, ■■■,Xn) with elements ruij such that 

Xi X2 Xn 

mil = I "121 = , ■■■,rnni 



ixl + ... + xl Jxl + ... + xl Jxl + ... + xl 
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□ 

Let Ln be the language a^a2-..ajj. 
Theorem 3.1 The language Ln (n> 1) can be recognized by a l-way QFA 

n + l 

with the probability of correct answer p where p is the root of p"-^ + p = 1 
in the interval [1/2, 1]. 

Proof: Let ruij be the elements of the matrix M^.(2;i, X2, 2;^) from Lemma 
3.1. We construct a k x (k — 1) matrix Tk{xi,X2, with elements tij = 

mjj+i. Let Rk{xi, X2, ■■■,Xk) he a k x k matrix with elements rij = aT' ^i. i 
and Ik be the k x k identity matrix. 
For fixed n, let p„ E [1/2, 1] satisfy p. 



n+l 
n-1 



+ Pn = 1 and Pk ^ k < n) 



fc-l k 

Pn''^ — Pn~^ ■ It is easy to see that pi + P2 + ■■■ + Pn = ^ and 



PnjPk + •■• +Pn 
{Pk-l + ... +Pn 



2(fc-l) 

2(fc-2) 
Pn 



n + l 

Pn 



■ Pn 



Now we describe a l-way QFA accepting the language L„. 

The automaton has 2n states: qi, q2, ■■■ Qn are non halting states, 
Qn+i, Qn+2, ■■■ Q2n~i rejecting states and q2n is an accepting state. The 
transition function is defined by unitary block matrices 
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In-l 

\ 1 y 




Case 1. The input is Kolal-.-a* $. 

The starting superposition is After reading the left cndmarkcr the 
superposition becomes .Jpi \qi) + \q2) + . . . + ^Jp^ \qn) and after reading 
a\ the superposition remains the same. 

If the input contains then reading the first Ofc changes the non-halting 
part of the superposition to y/pk\<lk) + ■ • • + ^/Pn\(ln) and after reading all 
the rest of the non-halting part of the superposition remains the same. 

Reading the right endmarker maps \cin) to \Q2n) • Therefore, the super- 
position after reading it contains \q2n)- This means that the automaton 
accepts with probability p„ because g2n is an accepting state. 



Case 2. The input is Ka\a2---a*}.akam--- (k > m). 

After reading the last the non-halting part of the superposition is 

^/Pk\qk) 

-|- . . . -|- Y^Pn kn)- Then reading am changes the non-halting part to 



(Pm + ---+Pn) 



^(pfc + ...+Pn) 



-, \ — \ — N— \qn) ■ This means that the automaton 

accepts with probability < Py^^''2"'^^''yi and rejects with probability at least 

\Pm~T---~rPn) 



^ _ PnjPk + - +Pnf Pn{Pk + - + Pn)^ 



(Pm + - +Pn)^ 

that follows from (1). □ 



-I- ... +Pn)^ 



-Pr, 



Corollary 3.1 The language Ln can be recognized by a 1-way QFA with the 
probability of correct answer at least ^ + f , for a constant c. 



Proof: By resolving the equation = 1, we get p = ^ + ©(^). □ 
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Theorem 3.2 The language cannot he recognized by a 1-way QFA with 
probability greater than p where p is the root of 



n — 1 y n — 1 

in the interval [1/2, 1]. 

Proof: Assume we are given a 1-way QFA M. We show that, for any e > 0, 
there is a word such that the probabihty of correct answer is less than p + e. 



Lemma 3.2 JjAF 9S{ ] Let X £ S"*". There are subspaces Ei, E2 such that 



Enon = El® E2 and 

(i) Ifi^eEi, then V^{-iP) G Ei, 

(ii) If Tp G E2, then ||V^fe(V')|| ~^ when k — > 00. 

We use n — 1 such decompositions: for x = 02, x = 03, . . ., x = a„. The 
subspaces Ei, E2 corresponding to x = are denoted and £'m,2- 

Let m G {2, . . . , n}, y £ ala2 ■ ■ ■ Om-i- Remember that ipy denotes the 
superposition after reading y (with observations w.r.t. Enon © Eacc © Erej 
after every step). We express ipy as ipy + tpy, Tpy e Em,i, ip^ G Em,2- 

Case 1. \\ipy\\ < \f^^E^ for some m G {2, . . . , n} and y G a^^ . . . Qm-i- 
Let i > 0. Then, yom-i G Ln but ya^Om-i ^ Ln- Consider the dis- 
tributions of probabihties on M's answers "accept" and "reject" on yom-i 
and ya^Om-i- If M recognizes Ln with probabihty p + e, it must accept 
yoni-i with probabihty at least p + e and reject it with probability at most 
1—p — e. Also, yalnCLm-i must be rejected with probability at least p + e and 
accepted with probability at most 1—p—e. Therefore, both the probabilities 
of accepting and the probabilities of rejecting must differ by at least 

(p + e) - {1-p- e) = 2p-l + 2e. 

This means that the variational distance between two probability distribu- 
tions (the sum of these two distances) must be at least 2(2p — 1) + 4e. We 
show that it cannot be so large. 

First, we select an appropriate i. Let k be so large that \\V'k (V'«)ll ^ ^ 

for S = e/4. ^y,V^^{ipy),V^2 (ipl), ... is a bounded sequence in a finite- 
dimensional space. Therefore, it has a limit point and there are i,j such 
that 
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We choose i,j so that i > k. 

The difference between the two probabihty distributions comes from two 
somxes. The first source is the difference between ^py and tpya^^ (the states of 
M before reading Om-i)- The second source is the possibility of M accepting 
while reading (the only part that is different in the two words). We bound 
each of them. 

The difference ipy — ipy^i can be partitioned into three parts. 

i^y - = ii^y - i^l) + ii^l - V:.Ji^l)) + K^Ji^l) - %,^). (3) 

The first part is ipy — ij)y = ipy and \\ipy\\ < \f^^^-- The second and 
the third parts are both small. For the second part, notice that V^^ is 
unitary on Em,i (because Va^ is unitary and Va^ (ip) does not contain halting 
components for tp £ Em,i)- Hence, V^^ preserves distances on Em,i and 

IIV'^ - K» (V'^)ll = \K. {^D - v\^Ml)\\ < s 

y "■m y am ^ CLm ^ 

For the third part of (S), remember that ij^^Q^i = V'i (ipy)- Therefore, 
and llV^f^i II ^ because i > k. Putting all three parts together, we get 



Lemma 3.3 IBV 91 ] Let ip and cj) he such that ||'0|| < 1, ||</'|| < 1 and 



\\''P~4'\\ ^ Then the total variational distance resulting from measurements 
of (f) and ^ is at most 4e. 

This means that the difference between any probability distributions 
generated by ipy and ipyai^ is at most 



y n — 1 

In particular, this is true for the probability distributions obtained by ap- 
plying V$ and the corresponding measurements to ipy and tpyai^- 
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The probability of M halting while reading is at most ~ 



n-l 



Adding it increases the variational distance by at most . Hence, the 

total variational distance is at most 




2(1 -p) ^ 2(l-p) 
n — 1 y n — 1 

By definition of p, this is the same as {2p — 1) + 2e. However, if M dis- 
tinguishes y and yal^ correctly, the variational distance must be at least 
(2p — 1) + 4e. Hence, M does not recognize one of these words correctly. 

Case 2. HV'yll > \f^^^^ fo^' every m G {2, . . . ,n} and y £ a\. . .a^_]^. 
We define a sequence of words yi, • • • , ym £ Oi • • • «n- Let yi = ai and 
yk = yk-ia^ for A; G {2, . . . , n} where is such that 



The existence of is guaranteed by (ii) of Lemma 3.2. 

We consider the probability that M halts on y„ = aiag^ag^ • • • ^n" before 
seeing the right endmarker. Let k E {2, ...,n}. The probability of M 
halting while reading the part of yn is at least 

yfc 1 yfc 1 n — 1 n — 1 

By summing over all /c G {2, . . . ,n}, the probability that M halts on y„ is 
at least 

\n— 1 71—1/ 

This is the sum of the probability of accepting and the probability of reject- 
ing. Hence, one of these two probabilities must be at least (1 — p) — e/2. 
Then, the probability of the opposite answer on any extension of y„ is at 
most 1 — {1 — p — e/2) = p + e/2. However, y„ has both extensions that 
are in L„ and extensions that are not. Hence, one of them is not recognized 
with probability p + e. □ 

By solving the equation (|2|), we get 

Corollary 3.2 L„ cannot he recognized with probability greater than k + 
3 
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Proof: The right-hand side of (^) is at most + 4y because p > 1/2 
and, hence, 1 — p < 1/2. This imphes 



2p - 1 < — h 4 



n — 1 y n — 1^ 



11 1 11 
P < - + 2W + < - + 3^ 



2 \j n-1 2(n-l)~2 V"-l 
and L„ cannot be recognized with probabihty greater than p by Theorem 



3.2 



□ 

Let ni = 2 and = — + 1 for k > 1 (where c is the constant from 



Theorem 3A_). Also, define pfc = ^ + ;^- Then, Corollaries 3T and tj.2| imply 



Theorem 3.3 For every > 1, Lrij^ can be recognized with by a l-way QFA 
with the probability of correct answer pk but cannot be recognized with the 
probability of correct answer pk~i- 



Proof: By Corollary 3.1, L„^, can be recognized with probability ^ + ^ = 
Pk- 

On the other hand, by Corollary |3.2| , Ln^ cannot be recognized with prob- 
ability ^ + ■ The definition of implies n/j — 1 = , \/nk — 1 = 

3"fc-i 
c ' 

13 1 c 

o + / 1 = o + = Pk-l- 

□ 

Thus, we have constructed a sequence of languages -L„^, L„2, • • • such 
that, for each Ln,., the probability with which Ln^. can be recognized by a 
l-way QFA is smaller than for L„j,_^. 

Our final theorem is a counterpart of Theorem It generalizes Theo- 
rem 2.3. 



Theorem 3.4 Let L be a language and M be its minimal automaton. If 
there is no qi,q2,Q3,x,y such that 

1. the states qi,q2,q3 are pairwise different, 

2. If M starts in the state qi and reads x, it passes to q2, 

3. If M starts in the state q2 and reads x, it passes to q2, and 
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4- If M starts in the state q2 and reads y, it passes to q^, 

5. If M starts in the state and reads y, it passes to q^, 

6. hath q2 and q^ are neither "all- accepting" state, nor "all-rejecting" 
state, 

then L can be recognized by a 1-way quantum finite automaton with proba- 
bility p = 0.68.... 
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